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(54) Storage format conversion of computer graphics 



(57) An image processor converts single-band pixel 
components, each of which represents a single band of 
a multiple-band pixel, to multiple-band pixels. Data 
words t which include multiple single-band pixel compo- 
nents corresponding to first and third bands cf two or 
more multiple-band pixels, are merged to form an inter- 
leaved data word in which respective single-band pixel 
components of the merged data words are interleaved. 
Data words, which include multiple single-band pixel 
components corresponding to second and fourth bands 
of the two or more multiple-band pixels, are merged to 
form a second interleaved data word in which respective 
single-band pixel components of the merged data words 
are interleaved. The first-mentioned interleaved data 
word and the second interleaved data word are then 
merged to form a third interleaved data word in which 
respectively single-band pixel components of the first 
and second interleaved data word are interleaved within 
one another. In one embodiment, a single read opera- 
tion reads four single-band pixel components from each 
of three buffers which correspond to red, green, and 
blue bands, respectively, of a multiple-band graphical 
image. A single merge operation merges eight single- 
band pixel components representing alpha and green 
bands of four multiple-band pixels, and a single merge 
operation merges eight single-band pixel components 
representing blue and red bands of four multiple-band 
pixels. Two merge operations merge the respective 
merged data words to form four multiple-band pixels, 
each of which includes alpha, blue, green, and red com- 



ponents. The four multiple-band pixels are written to a 
destination buffer in four write operations. 
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Description multiple bands of the graphical image. 



FIELD OF THE INVENTION 

The present invention relates to graphical image 
processing in a computer system and, in particular, to a 
particularly efficient mechanism for combining multiple 
separate bands of a graphical image into a single; mul- 
tiple-band image buffer. 

BACKGROUND OF THE INVENTION 

In most computer graphics display devices in use 
today, color graphical images to be displayed must be 
in a 4-band t interleaved format in which four contiguous 
data components specify four respective components 
of a single pixel of the graphical image. For example, in 
some such devices, four contiguous bytes of data spec- 
ify alpha, blue, green and red components, respectively, 
of a single pixel. It is also common to store a graphical 
image as multiple, separate pixel arrays for each com- 
ponent of which the graphical image is composed. For 
example, a color graphical image may be stored in the 
memory of a computer as three separate buffers, the 
first buffer including red components of the pixels of the 
graphical image, the second buffer including green com- 
ponents of the pixels of the graphical image, and the 
third buffer including blue components of the pixels of 
the graphical image. 

To display a graphical image, each band of which 
is stored in a separate buffer, it is generally necessary 
to interleave the respective bands of the respective buff- 
ers into a single buffer of multiple-band pixels. For ex- 
ample, one such multiple-band pixel format is the gen- 
eral ABGR format which is generally alpha, blue, green, 
and red components of a single pixel stored as four re- 
spective, contiguous bytes. It is common for graphical 
images produced today to include approximately one 
million pixels. For example, common sizes for graphical 
images include rectangular grids of 1024-by-768 pixels 
or 12B0-by-1024 pixels, i.e., 786,432 and 1,31 0,720 nix- 
els, respectively. To produce from three separate buffers 
of respective bands of a graphical image a single buffer 
containing a four-band graphical image typically re- 
quires approximately three million read operations to 
read each component of each pixel from a respective 
one of the separate buffers and approximately four mil- 
lion write operations to store each band of each pixel in 
the single multiple-band buffer. Some conventional 
computer systems can store four (4) bytes in a single 
write operation and can therefore store approximately 
one million pixels in a single multiple-band buffer using 
approximately one million write operations. Because of 
the significant computer system resources required for 
such graphical image reformatting, a need persists in 
the industry for ever increasing efficiency in conversion 
of graphical images from multiple buffers of respective 
single bands of the graphical image to a single buffer of 



SUMMARY OF THE INVENTION 

s In accordance with the present invention, respec- 
tive components of two or more multiple-band pixels are 
read from each of two or more single-band graphical im- 
age buffers, each of which includes components of a 
respective band of the multiple-band pixels. Two or 
io more components of each band are stored in respective 
data words. For example, (i) two or more red compo- 
nents of two or more respective multiple-band pixels are 
read from a red buffer and are stored in a red data word, 
(ii) two or more green components of the two or more 
75 multiple-band pixels are read from a green buffer and 
are stored in a green data word, and (iii) two or more 
blue components of the two or more multiple-band pix- 
els are read from a blue buffer and are stored in a blue 
data word. Data words, which include two or more com- 
ponents corresponding to each of first and third bands 
of two or more multiple-band pixels, are merged to form 
an interleaved data word in which respective compo- 
nents of the merged data words are interleaved. For ex- 
ample, if the first and third bands of the multiple-band 
pixels are alpha and green bands, respectively, the in- 
terleaved data word includes a component representing 
the alpha band of a first multiple-band pixel followed by 
a component representing the green band of the first 
multiple-band pixel followed by a component represent- 
ing the alpha band of a second multiple-band pixel fol- 
lowed by a component representing the green band of 
the second multiple-band pixel, and so on. Data words, 
which include two or more components corresponding 
to each of second and fourth bands of the two or more 
multiple-band pixels, are merged to form a second in- 
terleaved data word in which respective components of 
the merged data words are interleaved. For example, if 
the second and fourth bands of the multiple-band pixels 
are blue and red bands, respectively, the second inter- 
leaved data word includes components representing the 
following in sequence: the blue band of the first multiple- 
band pixel, the red band of the first multiple-band pixel, 
the blue band of the second multiple-band pixel, the red 
band of the second multiple-band pixel, and so on. 

The first-mentioned interleaved data word and the 
second interleaved data word are then merged to form 
a third interleaved data word in which respective com- 
ponents of the first and second interleaved data words 
are interleaved with one another. Continuing in the 
above illustrative example, the third interleaved data 
word includes components representing the following in 
sequence: the alpha band of the first multiple-band pix- 
el, the blue band of the first multiple-band pixel, the 
green band of the first multiple -band pixel, the red band 
of the first multiple-band pixel, the alpha band of the sec- 
ond multiple-band pixel, the blue band of the second 
multiple-band pixel, the green band of the second mul- 
tiple-band pixel, the red band of the second multiple- 
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band pixel, and so on. Therefore, the third interleaved 
data word includes two or more multiple-band pixels in 
the alpha, blue, green, red format. 

In one embodiment, a single read operation reads 
eight components from each of three buffers which cor- 
respond to red, green, and blue bands, respectively, of 
a multiple-band graphical image. Of the eight compo- 
nents of each band read using the single read operation, 
four components of each band are merged to form four 
multiple-band pixels simultaneously. A single merge op- 
eration merges eight components representing alpha 
and green bands of four multiple-band pixels, and a sin- 
gle merge operation merges eight components repre- 
senting blue and red bands of thefour multiple-band pix- 
els. Two additional merge operations merge the merged 
data words to 1orm four multiple-band pixels, each of 
which includes alpha, blue, green, and red components. 
Eight multiple-band pixels, each of which is four bytes 
in length, are written to a destination buffer in four write 
operations. Thus, to convert eight pixels from single- 
band format to multiple-band format, e.g., alpha, blue, 
green, and red format, requires three read operations, 
eight merge operations, and four write operations, i.e., 
a total of fifteen operations. Using prior art techniques, 
such format conversion of three components of each of 
eight pixels to eight multiple-band pixels typically re- 
quires eight read operations for each band, i.e., twenty- 
lour read operations, and thirty-two write operations 
which total fifty-six operations. In addition, each compo- 
nent is typically moved individually into an interleaved, 
multiple-band format in a separate assignment opera- 
tion. Thus, thirty-two assignment operations are re- 
quired as well, bringing the total number of required op- 
erations to eighty-eight. As a result, the present inven- 
tion represents a significant reduction of the required 
processing resources to convert single-band pixel com- 
ponents to multiple-band pixels over prior art tech- 
niques. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram of a computer system 
which includes an image processor which builds from 
separate, single-band pixel component buffers a multi- 
ple-band pixel buffer in accordance with the present in- 
vention. 

Figure 2 is a logic flow diagram illustrating the con- 
struction of a multiple-band pixel buffer from separate, 
single-band pixel component buffers by the image proc- 
essor of Figure 1 in accordance with the present inven- 
tion. 

Figure 3 is a block diagram illustrating merge oper- 
ations used by the image processor of Figure 1 to con- 
struct from single-band pixel component buffers a mul- 
tiple-band pixel buffer in accordance with the present 
invention. 

Figure 4 is a block diagram illustrating a merge op- 
eration periormed by a computer processor of Figure 1 . 



Figure 5 is a block diagram of the computer proc- 
essor of Figure 1 in greater detail. 

DETAILED DESCRIPTION 

5 

In accordance with the present invention, compo- 
nents of multiple-band pixels are read from multiple sin- 
gle-band pixel component buffers and are merged to 
form multiple-band pixels in an interleaved format sub- 
to stantially simultaneously. 

Hardware Components of the Image Processing 
System 

is To facilitate appreciation of the present invention, 
the hardware components of the graphical image refor- 
matting system are briefly described. Computer system 
100 (Figure 1) includes a processor 102 and memory 
104 which is coupled to processor 102 through a bus 

20 106. Processor 102 fetches from memory 1 04 computer 
instructions and executes the fetched computer instruc- 
tions. Processor 102 also reads data from and writes 
data to memory 1 04 and sends data and control signals 
through bus 106 to one or more computer display de- 

2S vices 120 in accordance with fetched and executed 
computer instructions. Processor 102 is described in 
greater detail below. 

Memory 104 can include any type of computer 
memory and can include, without limitation, randomly 

30 accessible memory (RAM), read-only memory (ROM), 
and storage devices which include storage media such 
as magnetic and/or optical disks. Memory 1 04 includes 
an image processor 110, which is a computer process 
executing within processor 102 from memory 104. A 

35 computer process is a collection of computer instruc- 
tions and data which collectively define a task performed 
by computer system 100. As described more completely 
below, image processor 110 (i) reads red, green, and 
blue components of pixels from buffers 1 1 2R, 1 1 2G, and 

40 112B, respectively, (ii) combines those components to 
form multiple-band pixels, and (iii) stores those multiple- 
band pixels in destination buffer 114. 

Buffers 1 1 2R, 1 1 2G, and 1 1 2B and destination buff- 
er 114 are stored in memory T04. Buffers 112R, 112G, 

45 and 1 1 2B each store data representing individual bands 
of the same pixels. For example, buffers 112R, 112G, 
and 11 2B store red, green, and blue components, re- 
spectively, of the same pixels. In other words, if a par- 
ticular component of data at a particular location within 

so buffer 112R represents a red band of a particular pixel, 
the particular component of data at the same location 
within buffers 1 1 2G and 1 1 2B represent green and blue 
bands, respectively, of the same pixel. Therefore, buff- 
ers 11 2R, 112G, and 112B collectively represent a single 
ss collection of multiple-band pixels. 

Destination buffer 114 can be any graphical image 
buffer used in graphical image processing. For example, 
destination buffer 114 can be a Z buffer which is used 
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in a conventional manner to remove hidden surfaces 
from a rendered graphical image Alternatively, destina- 
tion buffer 114 can be a frame buffer whose contents 
are immediately displayed in one of computer display 
devices 120. Each of computer display devices 120 can 
be any type of computer display device including without 
limitation a printer, a cathode ray tube (CRT), a light- 
emitting diode (LED) display, or a liquid crystal display 
(LCD). Each of computer display devices 120 receives 
from processor 102 control signals and data and, in re- 
sponse to such control signals, displays the received da- 
ta. Computer display devices 1 20, and the control there- 
of by processor 102, are conventional. 

The construction from multiple single-band buffers 
1 1 2R, 1 1 2G, and 1 1 2B of multiple-band destination buff- 
er 1 1 4 by image processor 1 1 0 is illustrated as logic flow 
diagram 200 (Figure 2). Processing according to logic 
flow diagram 200 begins with loop step 202. Loop step 
202 and next step 218 define a loop in which image proc- 
essor 110 (Figure 1) processes all of the components of 
buffers 112R, 112G, and 112G according to steps 
204-216. Eight of the multiple-band pixels represented 
collectively by buffers 11 2R, 112G, and 112B are proc- 
essed in a-single iteration of the loop defined by loop 
step 202 and next step 218. For each eight of the mul- 
tiple-band pixels, processing transfers from loop step 
202 to step 204 

In step 204, image processor 110 (Figure 1) reads 
eight components from buffer 11 2R in a single read op- 
eration. Processor 102 performs a read operation in 
which sixteen contiguous bytes of data can be read from 
memory 104. Image processor 110 invokes the read op- 
eration and causes processor 102 to perform a data 
alignment operation which shifts the read data such that 
the byte representing the first of the eight components 
of buffer 1 1 2R to be processed according to the current 
iteration of the loop defined by loop step 202 (Figure 2) 
and next 218 is aligned on an eight-byte boundary. In a 
preferred embodiment, image processor 110 (Figure 1) 
determines whether the sixteen bytes of data read in 
step 204 (Figure 2) are already aligned on an eight-byte 
boundary prior performing the data alignment operation. 
If the sixteen bytes of data are already so aligned, image 
processor 110 (Figure 1) does not perform the data 
alignment operation. 

In this illustrative embodiment, each component of 
buffers 112R, 112G, and 112B which represents a band 
of a pixel is a single byte. While. data representing eight 
pixels are retrieved substantially simultaneously, data 
representing four pixels are converted from single-band 
format to multiple-band format substantially simultane- 
ously. Thus, four contiguous bytes representing respec- 
tive pixel components from buffer 112Rare stored in da- 
ta word 312 (Figure 3) of image processor 110 (Figure 
1). Data word 312 (Figure 3) includes four partitioned 
bytes RO-3 which represent respective ones of the first 
four components read from buffer 112R Image proces- 
sor 110 (Figure 1) similarly includes in four partitioned 



bytes of a data word the second four components read 
1 rorn Cr&uffe i yg 1 ; 2Rf4 n a directly ana logous man ne r. 
Processl^ (Figure 2) to step 

206. : 

s In step 206, image processor 110 (Figure 1) reads 
eight components from buffer 112G in a single read op- 
eration and aligns the read data and stores the first four 
components in data word. 304 (Figure 3) of image proc- 
essor 110 (Figure 1) in a manner which is directly anal- 

10 ogous to that described above with respect to step 204. 
Data word 304 is directly analogous to data word 312 
and includes four partitioned bytes GO-3 which corre- 
spond to, i.e. , represent bands of the same pixels as the 
pixels whose bands are also represented by, bytes 

is ro-3. Processing transfers from step 206 (Figure 2) to 
step 208. 

In step 208, image processor 110 (Figure 1) reads 
eight components from buffer 112B in a single read op- 
erations and aligns the read data and stores the first four 

20 components in data word 310 (Figure 3) of image proc- 
essor 110 (Figure 1) in a manner which is directly anal- 
ogous to that described above with respect to step 204. 
Data word 310 is directly analogous to data words 312 
and 304 and includes four partitioned bytes B0 -3 which 

2S correspond to, i.e., represent bands of the same pixels 
as the pixels whose bands are also represented by, 
bytes R0-3. Processing transfers from step 208 (Figure 
2) to step 210. 

In this illustrative embodiment, image processor 

30 110 (Figure 1) reads components from buffers repre- 
senting red, green, and blue bands of the pixels of a 
graphical image and stores pixels having four bands in 
destination buffer 114. Therefore, image processor 110 
includes a data word 302 which includes four partitioned 

3S bytes XO-3 which represent the fourth band of four re- 
spective pixels and are initialized to have values of zero. 
In an alternative embodiment, bytes X0-3 are read from 
a fourth single-band buffer in the manner described 
above. 

40 In step 210 (Figure 2), image processor 110 merges 
bytes X0-3 and bytes G0-3 using a PMERGE operation 
306 which is performed by processor 102 (Figure 1 ) and 
is illustrated in Figure 4. Data word 402 is 32-bits in 
length and includes four partitioned bytes 402A-D. Sim- 

4S Marly, data word 404 is 32-bits in length and includes four 
partitioned bytes 404A-D. The PMERGE operation in- 
terleaves respective bytes of data words 402 and 404 
into a double data word 406 as shown. Double data word 
406 is 64 bits in length and includes eight partitioned 

so bytes 406A-H. The result of PMERGE operation 306 
(Figure 3) is double data word 308 which is 64-bits in 
length and whose eight partitioned bytes have the fol- 
lowing values: X0, GO, X1 , G1 , X2, G2, X3, and G3. Im- 
age processor 110 merges the second four bytes read 

55 from buffer 1 12G with bytes X0-3 in a directly analogous 
manner. Processing transfers from step 210 (Figure 2) 
to step 21 2. 

In step 212, image processor 110 (Figure 1) merges 
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bytes B0-3 (Figure 3) and R0 : 3 using a PMERGE oper- 
ation 314, which is directly analogous to PMERGE op- 
eration 306 described above. The result of PMERGE 
operation 314 is double data word 316 which is 64-bits 
in length and whose eight partitioned bytes have the fol- 
lowing values: BO, R0, B1 , R1, B2, R2, B3, and R3. Im- 
age processor 110 (Figure 1) also merges the second 
four bytes read from buffer 112B with the second four 
bytes read Irom buffer 1 1 2R in a directly analogous man- 
ner. Processing transfers from step 212 (Figure 2) to 
step 214. 

In step 21 4, image processor 1 1 0 (Figure 1 ) merges 
data words 30B (Figure 3) and 316 using PMERGE op- 
erations 318 and 322. Specifically, image processor 110 
(Figure 1 ) merges the upper halves 308A (Figure 3) and 
31 6 A of data words 308 and 316, respectively, using 
PMERGE operation 318 to form double data word 320. 
PMERGE operation 318 is directly analogous to 
PMERGE operations 308 and 316 described above. 
Double data word 320 is a 64-bit data word which in- 
cludes eight partitioned bytes whose respective values 
are those of bytes X0, B0, GO, R0, X1 , B1 , G1 , and R1 , 
respectively. Therefore, upper half 320A of double data 
word 320 includes bytes X0, B0, GO, and R0, which col- 
lectively represent a single four-band pixel in alpha, 
blue, green, and red format. Similarly, lower half 320B 
of double data word 320 includes bytes X1 , B1 , G1 , and 
R1, which collectively represent another single four- 
band pixel in alpha, blue, green, and red format. 

In an analogous manner, image processor 1 1 0 (Fig- 
ure 1) merges lower halves 308B (Figure 3) and 31 6B 
of double data words 308 and 316, respectively, using 
PMERGE operation 322 to form double data word 324. 
PMERGE operation 322 is directly analogous to 
PMERGE operations 306, 314, and 318 described 
above. Double data word 324 is a 64-bit data word which 
includes eight partitioned bytes whose respective val- 
ues are those of bytes X2, B2, G2, R2, X3, B3, G3, and 
R3, respectively. Therefore, upper half 324A of double 
data word 324 includes bytes X2, B2, G2, and R2, which 
collectively represent a third single four-band pixel in al- 
pha, blue, green, and red format. Similarly, lower half 
324 B of double data word 324 includes bytes X3, B3, 
G3, and R3, which collectively represent a fourth single 
four-band pixel in alpha, blue, green, and red format 

In a manner which is directly analogous to that de- 
scribed above with respect to Figure 3, image processor 
110 (Figure I) merges the second four bytes read from 
each of buffers 1 1 2B, 1 1 2G , and 1 1 2R to represent four 
more pixels in alpha, blue, green, and red format. 

Processing transfers from step 214 (Figure 2) to 
step 216 in which image processor 110 (Figure 1 ) stores 
data representing the eight pixels in alpha, blue, green, 
and red format, including data words 320 (Figure 3) and 
324, in destination buffer 114 in a single write operation. 
As described above, storage of pixels in destination 
buffer 114 can result immediately or indirectly in display 
of such pixels in one or more of computer display devic- 



es 120. From step 216 (Figure 2), processing transfers 
through next step 21 8 to loop step 202 in which the next 
eight pixels represented by buffers 112R, 112G, and 
11 2B collectively are processed according to steps 
5 204-216. Once all pixels represented by buffers 11 2R, 
11 2G, and 112B collectively have been processed ac- 
cording to the loop of loop step 202 and next step 218, 
processing according to logic flow diagram 200 com- 
pletes. 

io While it is generally described that all pixels repre- 
sented by buffers 112R (Figure 1), 112G, and 112B col- 
lectively are processed, eight pixels per iteration of the 
loop of loop step 202 (Figure 2) and next step 21 6, some 
buffers do not necessarily store pixels, of sequential 

is scanlines contiguously. Therefore, in a preferred em- 
bodiment, image processor 110 (Figure 1) processes in 
each iteration of the loop of loop step 202 (Figure 2) and 
next step 218 eight pixels of a particular scanline repre- 
sented by buffers 1 1 2R, 1 1 2G, and 1 1 2B collectively. In 

20 this preferred embodiment, image processor 110 (Fig- 
ure 1) processes each scanline of buffers 112R, 112G, 
and 112B in sequence. 

It is appreciated that scanlines of a particular graph- 
ical image represented by buffers 112R, 11 2G, and 

2S 112B sometimes has a number of pixels which is not 
evenly divisible by eight. In such circumstances, image 
processor 110 processes one, two, three, four, five, six, 
or seven pixels represented by buffers 1 1 2R, 1 1 2G, and 
112B collectively in the manner described above with 

30 respect to steps 204-21 6 while ignoring excess bytes of 
double data words 302, 304, 310, 312, 320, and 324. 
For example, if scanlines of a graphical image repre- 
sented by buffers 112R, 112B, and 112G include a 
number of pixels which is one more than an integer mul- 

35 tiple of eight, one pixel represented by buffers 112R, 
112G, and 112B is processed in the following manner 
for each scanline of buffers 112R, 112G, and 112B. 

Image processor 110 reads one pixel from each of 
buffers 112R, 112G, and 11 2B and stores the read byte 

40 at the upper byte of each of data words 312 (Figure 3), 
310, and 304, respectively, i.e., as bytes RO, GO, and 
B0, respectively. Data word 302 is set as described 
above and is not modified throughout processing ac- 
cording to logic flow diagram 200 (Figure 2). Bytes R1 

45 (Figure 3), R2, R3, GT, G2, G3, B1, B2, and B3 are ig- 
nored. PMERGE operations 306, 314, 318, and 322 are 
executed in the manner described above. As a result, 
bytes X0, B0, GO, and R0 are in upper half 320A of data 
word 320 and are stored in destination buffer 1 1 4 (Figure 

so 1) by image processor 110. Lower half 320B (Figure 3) 
of data word 320 and both halves of data word 324 are 
ignored. 

Figure 3 also illustrates merging of components of 
multiple-band pixels in a multiple-band format to sepa- 
ls rate each respective band of the multiple-band pixels 
into separate single bands. Such is illustrated, for ex- 
ample, by the following example. Data word 302 in- 
cludes bytes X0, X1, X2, and X3 which represent alpha, 
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blue, green, and red components, respectively, of a sin- 
gle pixel. In a directly analogous manner, respective 
bands of second, third, and fourth pixels are presented 
by respective bytes of data words 310, 304, and 312. 
Thus, bytes X0, BO, GO, and R0 represent alpha com- 
ponents of first, second, third, and fourth pixels, respec- 
tively. Similarly, (i) bytes X1, B1, G1, and R1 represent 
blue components of first, second, third, and fourth pix- 
els, respectively; (ii) bytes X2, B2, G2, and R2 represent 
green components of first, second, third, and fourth pix- 
els, respectively; and (iii) bytes X3, B3, G3, and R3 rep- 
resent red components of first, second, third, and fourth 
pixels, respectively. By operation of PMERGE opera- 
tions 306, 314, 318, and 322 in the manner described 
above, data word 320A includes alpha ccmponents of 
the four pixels, data word 320B includes blue compo- 
nents of the four pixels, data word 324A includes green 
components of the four pixels, and data word 324B in- 
cludes red components of the four pixels. 

Processor 102 

Processor 102 is shown in greater detail in Figure 
5 and is described briefly herein and more completely 
in United States patent application serial number 
08/236,572 by Timothy J. Van Hook, Leslie Dean Kohn, 
and Robert Yung, filed April 29,1994 and entitled "A 
Central Processing Unit with Integrated Graphics Func- 
tions" (the "572 application) which is incorporated in its 
entirety herein by reference. Processor 102 includes a 
prefetch and dispatch unit (PDU) 46, an instruction 
cache 40, an integer execution unit (IEU) 30, an integer 
register file 36, a floating point unit (FPU) 26, a floating 
point register file 38, and a graphics execution unit 
(GRU) 28, coupled to each other as shown. Additionally, 
processor 102 includes two memory management units 
(IMMU & DMMU) & 44a-44b, and a load and store unit 
(LSU) 48, which in turn includes data cache 120, cou- 
pled to each other and the previously described ele- 
ments as shown. Together, the components of proces- 
sor 102 fetch, dispatch, execute, and save execution re- 
sults of computer instructions, e.g., computer instruc- 
tions of image processor 110 (Figure 1), in a pipelined 
manner. 

PDU 46 (Figure 5) fetches instructions from mem- 
ory 104 (Figure 1)and dispatches the instructions to IEU 
30 (Figure 5), FPU 26, GRU 28, and LSU 48 accordingly. 
Prefetched instructions are stored in instruction cache 
40. IEU 30, FPU 26, and GRU 28 perform integer, float- 
ing point, and graphics operations, respectively. In gen- 
eral, the integer operands and results are stored in in- 
teger register file 36, whereas the floating point and 
graphics operands and results are stored in floating 
point register file 38. Additionally, IEU 30 also performs 
a number of graphics operations, and appends address 
space identifiers (ASI) to addresses of load/store in- 
structions for LSU 48, identifying the address spaces 
being accessed. LSU 48 generates addresses for all 



load and store operations. The LSU 48 also supports a 
number of load and store operations, specifically de- 
signed for graphics data. Memory references are made 
in virtual addresses. MMUs 44a-44b map virtual ad- 
5 dresses to physical addresses. 

PDU 46, IEU 30, FPU 26, integer and floating point 
register files 36 and 38, MMUs 44a-44b, and LSU 48 
can be coupled to one another in any of number of con- 
figurations as described more completely in the '572 ap- 
10 plication. As described more completely in the '572 ap- 
plication with respect to Figures 8a-8d thereof, GRU 28 
performs a number of distinct partitioned multiplication 
operations and partitioned addition operations. Various 
partitioned operations used by image processor 110 
(Figure 1) are described more completely below. 

As described above, processor 102 includes four 
(4) separate processing units, i.e., LSU 48, IEU 30, FPU 
26, and GRU 28. Each of these processing units is de- 
scribed more completely in the '572 application. These 
20 processing units operate in parallel and can each exe- 
cute a respective computer instruction while others of 
the processing units executes a different computer in- 
struction. GRU 28 executes the merge operations de- 
scribed above. 

2S in one embodiment, processor 102 is the Ul- 
traSPARC processor available from SPARC Internation- 
al, Inc., and computer system 100 (Figure 1) is the Ul- 
traSPARC station available from Sun Microsystems, Inc. 
of Mountain View, California. Sun, Sun Microsystems, 

30 and the Sun Logo are trademarks or registered trade- 
marks of Sun Microsystems, Inc. in the United States 
and other countries. All SPARC trademarks are used 
under license and are trademarks of SPARC Interna- 
tional, Inc. in the United States and other countries. 

35 Products bearing SPARC trademarks are based upon 
an architecture developed by Sun Microsystems, Inc. 



Claims 

40 

1. A method for constructing at least one multiple- 
band pixels from multiple single-band pixel compo- 
nents, the method comprising: 

45 (a) combining a first data word including at least 

two single-band pixel components represent- 
ing a first band of respective multiple-band pix- 
els with a second data word including at least 
two single-band pixel components represent- 
so ing a third band of the respective multiple-band 
pixels to form a first interleaved data word, 
which includes interleaved single-band pixel 
components of the first and second data words; 
(b) combining a third data word including at 
55 least two single-band pixel components repre- 
senting a second band of the respective multi- 
ple-band pixels with a fourth data word includ- 
ing at least two single-band pixel components 
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representing a fourth band of the respective 
multiple-band pixels to form a second inter- 
leaved data word, which includes interleaved 
single-band pixel components of the third and 
fourth data words; and 5 
(c) combining the first and second interleaved 
data words to form a third interleaved data word 
which includes at least the following single- 
band pixel components in the following exem- 
plary order: 10 

the first of the at least two single-band pixel 

components of the first data word; 

the first of the at least two single-band pixel 

components of the data third word; 15 

the first of the at least two single-band pixel 

components of the second data word; 

the first of the at least two single-band pixel 

components of the fourth data word; 

the second of the at least two single-band 20 

pixel components of the first data word; 

the second of the at least two single-band 

pixel components of the third data word; 

the second of the at least two single-band 

pixel components of the second data word; 2s 

and 

the second of the at least two single-band 
pixel components of the fourth data word. 

2. The method of Claim 1 wherein steps (a) and (b) 30 
are performed by a computer processor; 

further wherein each of steps (a) and (b) is 
performed in a single instruction cycle of the com- 
puter processor. 

36 

3. The method of Claim 1 further comprising; 

reading the second, third, and fourth data 
words from first, second, and third buffers, respec- 
tively, which are stored in a memory of the compu- 
ter. *° 

4. The method of Claim 3 further comprising: 

assigning to the first data word a predeter- 
mined value. 

46 

5. The method of Claim I further comprising: 

storing the third interleaved data word in a 
destination buffer in a memory of the computer. 

6. The method of Claim 1 wherein the first, second, s° 
third, and fourth bands of the multiple-band pixels 

are alpha, blue, green, and red bands, respectively. 

7. A computer program product which includes a com- 
puter usable medium having computable readable 55 
code embodied therein for constructing at least one 
multiple-band pixels from multiple single-band pixel 
components, the computer readable code compris- 



ing: 

(a) a first merge module configured to combine 
a first data word including at least two single- 
band pixel components representing a first 
band of respective multiple-band pixels with a 
second data word including at least two single- 
band pixel components representing a third 
band of the respective multiple-band pixels to 
form a first interleaved data word, which in- 
cludes interleaved single-band pixel compo- 
nents of the first and second data words; 

(b) a second merge module configured to com- 
bine a third data word including at least two sin- 
gle-band pixel components representing a sec- 
ond band of the respective multiple-band pixels 
with a fourth data word including at least two 
single-band pixel components representing a 
fourth band of the respective multiple-band pix- 
els to form a second interleaved data word, 
which includes interleaved single-band pixel 
components of the third and fourth data words; 
and 

(c) a third merge module operatrvely coupled to 
the first and second merge modules and con- 
figured to combine the first and second inter- 
leaved data words to form a third interleaved 
data word which includes at least the following 
single-band pixel components in the following 
exemplary order: 

the first of the at least two single-band pixel 

components of the first data word; 

the first of the at least two single-band pixel 

components of the third data word; 

the first of the at least two single-band pixel 

components of the second data word; 

the first of the at least two single-band pixel 

components of the fourth data word; 

the second of the at least two single-band 

pixel components of the first data word; 

the second of the at least two single-band 

pixel components of the third data word; 

the second of the at least two single-band 

pixel components of the second data word; 

and 

the second of the at least two single-band 
pixel components of the fourth data word. 

8. The computer program product of Claim 7 wherein 
the first merge module is further configured to com- 
bine the first and second data words in a single in- 
struction cycle of a computer processor. 

9. The computer program product of Claim 7 wherein 
the second merge module is further configured to 
combine the third and fourth data words in a single 
instruction cycle of a computer processor. 
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10. The computer program product of Claim 7 further 
comprising: 

a data reading module operatively coupled to 
the first and second merge modules and configured 
to read the second, third, and fourth data words 
from first, second, and third buffers, respectively, 
which are stored in a memory of the computer. 

11. The computer program product of Claim 10 further 
comprising: 

a first data word initialization module opera- 
tively coupled to the first merge module and config- 
ured to assign to the first data word a predetermined 
value. 

12. The computer program product of Claim 7 further 
comprising: 

a data writing module operatively coupled to 
the third merge module and configured to store the 
third interleaved data word in a destination buffer in 
a memory of the computer. 

13. The computer program product of Claim 7 wherein 
the first, second, third, and fourth bands of. the mul- 
tiple-band pixels are alpha, blue, green, and red 
bands, respectively, 

14. An image processor for constructing at least one 
multiple-band pixels from multiple single-band pixel 
components, the image processor comprising: 

(a) a first merge module configured to combine 
a first data word including at least two single- 
band pixel components representing a first 
band of respective multiple-band pixels with a 
second data word including at least two single- 
band pixel components representing a third 
band of the respective multiple-band pixels to 
form a first interleaved data word, which in- 
cludes interleaved single-band pixel compo- 
nents of the first and second data words; 

(b) a second merge module configured to com- 
bine a third data word including at least two sin- 
gle-band pixel components representing a sec- 
ond band of the respective multiple-band pixels 
with a fourth data word including at least two 
single-band pixel components representing a 
fourth band of the respective multiple-band pix- 
els to form a second interleaved data word, 
which includes interleaved single-band pixel 
components of the third and fourth data words; 
and 

(c) a third merge module operatively coupled to 
the first and second merge modules and con- 
figured to combine the first and second inter- 
leaved data words to form a third interleaved 
data word which includes at least the following 
single-band pixel components in the following 



exemplary order: 

the first of the at least two single-band pixel 
components of the first data word; 

s the first of the at least two single-band pixel 

components of the third data word; 
the first of the at least two single-band pixel 
components of the second data word; 
the first of the at least two single-band pixel 

io components of the fourth data word; 

the second of the at least two single-band 
pixel components of the first data word; 
the second of the at least two single-band 
pixel components of the third data word; 

is the second of the at least two single-band 

pixel components of the second data word; 
and the second of the at least two single- 
band pixel components of the fourth data 
word. 
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15. The image processor of Claim 14 wherein the first 
merge module is further configured to combine the 
first and second data words in a single instruction 
cycle of a computer processor. 

16. The image processor of Claim 14 wherein the sec- 
ond merge module is further configured to combine 
the third and fourth data words in a single instruction 
cycle of a computer processor. 

17. The image processor of Claim 14 further compris- 
ing: 

a data reading module operatively coupled to 
the first and second merge modules and configured 
to read the second, third, and fourth data words 
from first, second : and third buffers, respectively, 
stored in a memory of the computer. 

18. The image processor of Claim 17 further compris- 
ing: 

a first data word initialization module opera- 
tively coupled to the first merge module and config- 
ured to assign to the first data word a predetermined 
value. 



19. The image processor of Claim 14 further compris- 
ing: 

a data writing module operatively coupled to 
the third merge module and configured to store the 
so third interleaved data word in a destination buffer in 
a memory of the computer. 

20. The image processor of Claim 14 wherein the first, 
second, third, and fourth bands of the multiple-band 

ss pixels are alpha, blue, green, and red bands, re- 
spectively. 

21. A computer system. comprising: 
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a memory; 

a computer processor operatively coupled to 
the memory; and 

an image processor which is stored in the mem- 
ory and which includes at least one computer s 
instructions which are executed within the com- 
puter processor to construct at least one multi- 
ple-band pixels from multiple single-band pixel 
components, the image processor including: 

10 

(a) a first merge module configured to com- 
bine a first data word including at least two 
single-band pixel components represent- 
ing a first band ol respective multiple-band 
pixels with a second data word including at is 
least two single-band pixel components 
representing a third band of the respective 
multiple-band pixels to form a first inter- 
leaved data word, which includes inter- 
leaved single-band pixel components of 20 
the first and second data words; 

(b) a second merge module configured to 
combine a third data word including at least 
two single-band pixel components repre- 
senting a second band of the respective 2s 
multiple-band pixels with a fourth data 
word including at least two single-band pix- 
el components representing a fourth band 

of the respective multiple-band pixels to 
form a second interleaved data word, 30 
which includes interleaved single-band 
pixel components of the third and fourth da- 
ta words; and 

(c) a third merge module operatively cou- 
pled to the first and second merge modules 35 
and configured to combine the first and 
second interleaved data words to form a 
third interleaved data word which includes 

at least the following single-band pixel 
components in the following exemplary *o 
order; . 

the first of the at least two single-band 
pixel components of the first data 
word; 45 
the first of the at least two single-band 
pixel components of the third data 
word; 

the first of the at least two single-band 
pixel components of the second data so 
word; 

the first of the at least two single -band 
pixel components of the fourth data 
word; 

the second of the at least two single- 55 
band pixel components of the first data 
word; 

the second of the at least two single- 



band pixel components of the third da- 
ta word; 

the second of the at least two single- 
band pixel components of the second 
data word; and 

the second of the at least two single- 
band pixel components of the fourth 
data word. 

22. The computer system of Claim 21 wherein the first 
merge module is further configured to combine the 
first and second data words in a single instruction 
cycle of a computer processor. 

23. The computer system of Claim 21 wherein the sec- 
ond merge module is further configured to combine 
the third and fourth data words in a single instruction 
cycle of a computer processor. 

24. The computer system of Claim 21 wherein the im- 
age processor further comprises; 

a data reading module operatively coupled to 
the first and second merge modules and configured 
to read the second, third, and fourth data words 
from first, second, and third buffers, respectively, 
stored in a memory of the computer. 

25. The computer system of Claim 24 wherein the im- 
age processor further comprises: 

a first data word initialization module opera- 
tively coupled to the first merge module and config- 
ured to assign to the first data word a predetermined 
vaiue. 

26. The computer system of Claim 21 wherein the im- 
age processor further comprises: 

a data writing module operatively coupled to 
the third merge module and configured to store the 
third interleaved data word in a destination buffer in 
a memory of the computer. 

The computer system of Claim 21 wherein the first, 
second, third, and fourth bands of the multiple-band 
pixels are alpha, blue, green, and red bands, re- 
spectively. 

A system for distributing code (i) which is stored on 
a computer-readable medium, (ii) which is execut- 
able by a computer, and (iii) which includes at least 
one module, each of which in turn is configured to 
carry out at least one function to be executed by the 
computer, the system comprising: 

(a) a first merge module configured to combine 
a first data word including at least two single- 
band pixel components representing a first 
band of respective multiple-band pixels with a 
second data word including at least two single- 
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band pixel components representing a third 
band of the respective multiple-band pixels to 
form a first interleaved data word, which in- 
cludes interleaved single-band pixel compo- 
nents of the first and second data words; 5 

(b) a second merge module configured to com- 
bine a third data word including at least two sin- 
gle-band pixel components representing a sec- 
ond band of the respective multiple-band pixels 
with a fourth data word including at least two 10 
single-band pixel components representing a 
fourth band of the respective multiple-band pix- 
els to form a second interleaved data word, 
which includes interleaved single-band pixel 
components of the third and fourth data words; 
and 

(c) a third merge module operatively coupled to 
the first and second merge modules and con- 
figured to combine the first and second inter- 
leaved data words to form a third interleaved 20 
data word which includes at least the following 
single-band pixel components in the following 
exemplary order: 

the first of the at least two single-band pixel 25 

components of the first data word; 

the first of the at least two single-band pixel 

components of the third data word; 

the first of the at least two single-band pixel 

components of the second data word; 30 

the first of the at least two single-band pixel 

components of the fourth data word; 

the second of the at least two single-band 

pixel components of the first data word; 

the second of the at least two single-band 35 

pixel components of the third data word; 

the second of the at least two single-band 

pixel components of the second data word; 

and 

the second of the at least two single-band 40 
pixel components of the fourth data word. 

29. The system of Claim 28 wherein the first merge 
module is further configured to combine the first and 
second data words in a single instruction cycle of a «s 
computer processor. 



32. The system of Claim 31 further comprising: 

a first data word initialization module opera- 
tively coupled to the first merge module and config- 
ured to assign to the first data word a predetermined 
value. 

33. The system of Claim 28 further comprising: 

a data writing module operatively coupled to 
the third merge module and configured to store the 
third interleaved data word in a destination buffer in 
a memory of the computer. 

34. The system of Claim 28 wherein the first, second, 
third, and fourth bands of the multiple-band pixels 
are alpha, blue, green, and red bands, respectively. 



30. The system of Claim 28 wherein the second merge 
module is further configured to combine the third 
and fourth data words in a single instruction cycle 
of a computer processor. 



so 



31. The system of Claim 28 further comprising: 

a data reading module operatively coupled to 
the first and second merge modules and configured 
to read the second, third, and fourth data words 
from first, second, and third buffers, respectively, 
stored in a memory of the computer. 
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(54) Storage format conversion of computer graphics 



(57) An image processor converts single-band pixel 
components, each of which represents a single band of 
a multiple-band pixel, to multiple-band pixels. Data 
words, which include multiple single-band pixel compo- 
nents corresponding to first and third bands cf two or 
more multiple-band pixels, are merged to form an inter- 
leaved data word in which respective single-band pixel 
components of the merged data words are interleaved. 
Data words, which include multiple single-band pixel 
components corresponding to second and fourth bands 
of the two or more multiple-band pixels, are merged to 
form a second interleaved data word in which respective 
single-band pixel components of the merged data words 
are interleaved. The first-mentioned interleaved data 
word and the second interleaved data word are then 
merged to form a third interleaved data word in which 
respectively single-band pixel components of the first 
and second interleaved data word are interleaved within 
one another. In one embodiment, a single read opera- 
tion reads foursingte-band pixel components from each 
of three buffers which correspond to red, green, and 
blue bands, respectively, of a multiple-band graphical 
image. A single merge operation merges eight single- 
band pixel components representing alpha and green 
bands of four multiple-band pixels, and a single merge 
jeration merges eight single-band pixel components 
senting blue and red bands of four multiple-band 



pixels. Two merge operations merge the respective 
merged data words to form four multiple-band pixels, 
each of which includes alpha : blue, green, and red com- 
ponents. The four multiple-band pixels are written to a 
destination buffer in four write operations. 
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